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Abstract 

This paper describes the definition of maximum likehhood equivalent estimators 
of the Trihnear Gauge Couphngs which include detector effects. The asymptotic 
properties of these estimators as well as their unbiasedness and efficiency when 
dealing with finite statistical samples are demonstrated by Monte Carlo experimen- 
tation, using simulated events corresponding to the reaction e"'"e~ — > qqlv at 172 
GeV. Emphasis is given to the determination of the expected efficiencies in extract- 
ing the a\Y(j,, otw and a^,^ couplings from LEPII data, which in this particular case 
found to be close to the maximum possible. 



1 Introduction 



It has been often emphasized that the hkehhood estimators can reach the lower bound of 
the Cramer-Rao ^ inequahty, 



.OHnL , , -1 



v(A) > i-Ei^),.-r > m 



where: 

A is the hkehhood estimation of a parameter A 
L is the hkehhood function 

V and E are the variance and expected value operators respectively 
Xx denotes the maximum available information with respect to A. 

The right hand side bound depends on the construction of the probability density function 
(p.d.f.). Thus, as has been shown in and P], in order to reach the maximum accuracy in 
estimating the Trilinear Gauge Couplings, the complete kinematical information has to be 
included in L. However, when the measured kinematic vectors are used, the experimental 
uncertainties must be explicitly taken into account in the p.d.f. formulation. In other 
words, the data must be compared with the distribution: 

P(fi; A) = y" g{V; A) • R{V, n)dV (2) 



rather than with 



where 



(7obs.{^) 



V = {Vi, . . . ,Vm} and Q = {Qi, . . . ,Qrn} are the true and measured m-dimensional 
kinematic vectors respectively. 

A is the set of p physics parameters {Ai, . . . , Ap} which are needed to define completely 
the p.d.f. 

^{V) is the selection efficiency function taking values continuously between zero and one, 

daiV; X)/dV is the differential cross section. 

o'obs.W is the total observed cross section defined as[|: 

(Tobs.W = [ {da{V; X)/dV) ■ e{V)dV (4) 



R{V,Q) is the resolution function, or else the probability for the true kinematic vector 
V of an event to be measured as Q. 



^Throughout this paper the integrations are meant to be over the whole phase space. 
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g{V; A) is the p.d.f. according to which the vectors V are distributed and 
P(i1; A) is the p.d.f. according to which the vectors (1 are distributed. 

The use of (0), in muhi- dimensional phase space, faces several difficulties of which 
the parameterization of the resolution functions is the most serious. The often offered 
remedies consist of either using @ instead of with the hope that possible biases will 
be determined afterwards by Monte Carlo (M.C.) experimentation or of integrating over 
some of the kinematic variables and paramererizing the resolution with respect to the 
remaining fewer kinematic components. 

This paper concentrates on the definition of likelihood equivalent estimators which, 
whilst including the description of all the detector effects, retain the maximum likelihood 
sensitivity in extracting the values of the Trilinear Gauge Couplings. The next Section 
briefly describes the topological and kinematic features of the data samples used in this 
analysis. In Section 3, ways of reducing the number of the necessary kinematic variables 
are discussed when the detector effects are negligible. In Section 4 the results obtained 
in the ideal case are extrapolated to the case when the detector effects are important. A 
demonstration of the optimal properties of the proposed techniques is given at Section 
5 by Monte Carlo experimentation. Finally, Section 6 compiles the conclusions of this 
analysis. 

2 The Four-Fermion Semileptonic Final States 

One of the important physics issues of the LEPII program is the determination of the 
non-Abelian self couplings ( Trilinear Gauge Couplings, in the following TGC's) of the 
carriers of the electroweak interaction. The measurement of the WWV couplings, where 
W stands for the W gauge boson whilst V denotes the 7 or neutral carriers, is possible 
via WW production [Q] through diagrams involving exchange of Z° and 7. Limiting the 
phenomenological analysis to the contribution of gauge invariant operators of dimension 
less than six 0, there are 3 CP conserving {aw^, «iy and ascp) and 2 CP violating {asw 
and dw) couplings which by deviating from their Standard Model values, could point to 
the existence of new physics. 

Several final state topologies, corresponding to the production of a pair or of a single 
W and to the different W decay modes, can be used to determine the TGC's. In this paper 
only final states including a charged lepton, two hadronic jets and an invisible neutrino 
are considered. These final states, in the following semileptonic final states, suffer less 
from kinematic ambiguities and consequently a more precise measurement can be made. 

However, W pair production diagrams alone are not sufficient to express the pro- 
duction dynamics of the semileptonic states 0]. Throughout this paper full four-fermion 
phenomenological models including Coulomb corrections and Initial State Radiation (ISR) 
0,0 are used to describe the kinematic distributions and/or to produce Monte Carlo 
events as functions of the CP conserving couplings. 

The detector simulation program DELSIM was used to study the effect of the 
distortion of the kinematic distributions, due to the event selection efficiency and the 
detector resolution, on the measurement of the TGC's. Special care was taken in selecting 
the semileptonic four fermion final state topologies in order to avoid contamination from 
products irrelevant to the determination of the TGCs. The selection criteria, based on 
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the lepton identification jet reconstruction and missing visible energy, are discussed in 
details in P]. An improvement in the measurement accuracy of the kinematic vectors was 
achieved by a six-constraint kinematic fit. 

After the constrained fit, all the eight kinematic variables needed to specify an event 
completely were determined up to a degeneracy in the charge of the hadronic jets. The 
latter resulted in a loss of information by averaging the squared matrix elements corre- 
sponding to the two momentum assignments @ . 



3 The Ideal Case 

The differential cross section is parameterized as a quadratic function of the TGC's of 
the form: 

daiy- X)/dV = coiy) + E (^) ■ Afe + E (V") ■ ■ (5) 

k fj,,k 

where the vector V is of dimension eight and A = {Ai, . . . , Ap} is the set of the TGC's 
which are allowed to deviate from their Standard Model (S.M.) values. Following eq. @, 
the total cross section can be expressed as: 

^tot(A) = 5o + E '^f • Afc + E St -X.-Xk (6) 

k fik 

si = j4iy)dv (7) 

When the detector effects are negligible the p.d.f. takes the form: 

^ co(^) + c\{V) ■ Afc + E,,k -X.-X, 
'S'o + J2k Si ■ Xk + J2fj.k ■ A^j • Afc 

and the likelihood function for a set of N observed events with kinematic vectors Vn, 
n = 1, . . . , N is defined as: 

N 

L{Vu...,VN;X) = l[giVn,X) (9) 

n=l 

3.1 Optimal Variables 

The kinematic distribution with respect to a subset of the components of V, let's say 
X = {Vi, . . . , Vu} with u < m, is found by integrating (||) as: 

w{qi, ...,q,;X)=J g{V; A) ■ 5{qi - V^) ■ ■ ■ 5{q, - V,)dVi ■ ■ ■ dV, ■ ■ ■ dV^ (10) 

and the corresponding likelihood function for N events is: 

N 

L(Xi,...,X^;A) = n^(^n;A) (11) 

ra=l 



^ In the 6c kinematical fit the total four-momentum vector was constrained to (0,0,0,172 GeV) and 
the invariant masses of the hadronic and the leptonic system were required to be equal to 80.35 GeV/c^. 
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The likelihood estimators of the couplings A are defined as the solutions of the following 
system of equations : 

^ ~ ' ~ — ^]a=A ■ -^("^1' • • • ' -^N] A = Xtrue)dXi ■ ■ ■ dXjsf = (12) 



or equivalently M 



.d\nuj{X;X). 

9a7 



N ■ I [ ^^^]X=X ■ ^(^; A = Xtrue)dX = (13) 



where Xtme, X are the true and the estimated values of the couplings respectively. If a 
single set of N observed events is available eq. (|13]) is approximated as: 



glnL(Xi,...,X^;A) , ^ dlnw{Xk;X) ^ _ 

dX, ^'='~h 9X, J^=^"^ ^ ' 

The maximum likelihood estimation (0) satisfies at the asymptotic limit the left 
side bound of ([^). However, the right side bound of the Cramer- Rao inequality is not 
necessarily satisfied after projecting the p.d.f. Indeed, in and traditional kinematic 
distributions such as the W production angle, the charged lepton angle with respect to 
the W direction in the W rest frame etc. have been rated according to their efficiency 
in estimating the TGC's and they have been found inferior to the eight-fold unbinned 
likelihood fit. 

Searching for a subset of the kinematic components which contain all the available 
information with respect to A, one has to start from the error matrix of the likelihood 
estimators when all the kinematic variables have been used, as in (§) and (|^). In the 
asymptotic limit this matrix has elements given by (|l]), i.e.: 

-1 ,/-a^ln9(t^;A = A)_^„,.x_X_,^^^),^]_, 



N 7 dXidXj 

— * 

The second derivative in (|T^) depends on the kinematic vectors V through terms such 
as Ci{V) / co(y) and {V) / co{V) . This has significant imphcations especially when only 
one parameter TGC models are used. In this case the p.d.f. is written as: 

^^^'^^ = — So + S,-X + S,.X^ — 

— * — » — * 

where the terms Co{V), Ci{V) and C2{V) correspond to the particular choice of the TGC 
A. 

The error in the maximum likelihood estimation of A will be: 

y,^^, ^ 2-S2-{So + s^-x + S2-x')-is, + 2-S2- xy 

N {So + Si-X + S2- A2)2 

2 • Q2{V) ■{1 + Q,{V)-X + Q2{V) ■ A2) - {Q,{V) + 2 • Q2{V) ■ A)^ 



+ 



(l + gi(F)-A + Q2(V^)-A2)2 
■giV;Xtrue)dV]-' (17) 
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where 



Q.(V) = (18) 



co{V) 

However, when the hkehhood function is expressed in terms of the projected proba- 
bihty distribution: 

w{q,, g2; X)=J g{V; A) ■ - Q,{V)) ■ 5^ - Q2{V))dV (20) 
the variance of the estimated couphng will be: 

-1 . 2-S2-{So + Si-X + S2-X^)~iSi + 2-S2-Xf 



V{X) 



N ^So + Si-x + S2-x^y 

2 ■ g2 • (1 + gi • A + g2 • A^) - (gi + 2 ■ g2 • A)^ 
(1 + gi ■ A + g2 ■ A2)2 
■9{V; Xtrue) ■ 5(gi - Qi{V)) ■ 5{q2 - Q2{V))}dVdq^dq2]-' (21) 



Reversing the order of integration in (pi]) and integrating first with respect to qi and 



g2 it is easy to see that the achieved accuracy by employing the projected distribution 



is the same as when the complete mult i- dimensional distribution (16) is used 



3.2 The Optimal Observables 

The reduction of the number of necessary kinematic components is not so dramatic when 
more than one TGC are to be simultaneously extracted from the dataQ. There is, though, 
the possibility of a further reduction in the number of the necessary kinematic variables 
by expanding the p.d.f. in a Taylor series and keeping only the linear terms. Returning 
to the general case of estimating simultaneously p couplings, the distribution function (||) 
is approximated in the neighborhood of the expansion point A° as: 



,(V.J)^yjmM-,tii^-'-^).K,] (22) 



where 



yo{V; A°) = coiV) + ^ C'AV) ■ A° + ^ (^) ■ Aj ■ A° (23) 

k fj,k 



y'l{V;X') = c'i{V)+Y.CtiV)-Xl (24) 



As an example, in a two couplings model the eight kinematic variables can be reduced to five. 
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= / y',dv (25) 



and Ai. = At — A° 



k — — ^k- 
Furthermore, by projecting (p2D as: 



z^(Oi(A°),...,0,(A°);A)= (26) 



g{V; A) ■ <5(0i(A'^) - ^^i^-^) ■ ■ ■ 6{0,iX') - '-^^^^)dV (27) 
2/o(^; AO) VoiV] AO) 

the number of the necessary kinematic variables are reduced to as many as the number of 
parameters to be fitted simuhaneously without losing information. It has been also shown 
in P] that there are simple statistics equivalent to the maximum likelihood estimators. 
These are the mean values of 0^(1^, A") , called Optimal Observables, defined as: 

yQiV; A") 

which are linearly related to Aj(= Aj — A°) as: 
(0,(y;A°))3^ = (Ofc(\/;A°))xo 

+ jzmv- A°) ■ Ouiv- A°))xo - mv- A°)),^o ■ mv- a°)),^o] ■ a, (29) 



where the operation ($fc(V^; A°))^ denotes the convolution: 

{MV; A°))x = / ^kiV- A°) ■ g{y- \)dV (30) 



and the symbols with hats stand for the estimated quantities. Thus, around the initial 
value A°, the right hand side of ( ^9]) can be easily evaluated as a function of the values 
of the couplings (e.g. by using the four fermion M. C. generators to produce events at A° 
couplings), whilst the left hand side of ( |2^ ) is estimated by using the N kinematic vectors 
Vn, n = 1, . . . , N measured by the experiment as: 

/ 0,(K; A°) ■ g{V- X'^^dV ^ - ^ Ofe(K; A°) (31) 

n=l 

The couplings values A = {Ai, . . . , Ap}, are then estimated by solving the linear system (of 
p equations with p unknowns) defined by (^) and (pi]). Similarly the error in this esti- 
mation is evaluated in terms of the the variance of the experimental measured quantities 



It must be emphasized that, due to the approximation (|22|), this estimation is con- 
sistent only when the true coupling values are close enough to the expansion point. In 
the asymptotic limit, and when the above condition holds, the estimation based on the 
mean value of the optimal observables has the same0 efficiency as the maximum likeli- 
hood. However, when dealing with data sets of finite statistical size, special care must be 



*For a rather simple proof see at the Appendix A 
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taken to ensure that the approximation is vahd for the region of the couphngs values 
which correspond to mean values of Optimal Observables ( via eq. ^) lying within the 
uncertainty of the measured quantity (|3TD . 

In the case of small data sets, an increase in the estimating efficiency is expected by 
extending the definition of the likelihood function to count for the total number of the 
observed events. The extended likelihood (L*^^*) is defined as: 

L-\V,, ...,V^;X) = n 9iV. A) (32) 

where fi{X) is the expected number of events for a luminosity C which is dependent 
quadratically on the coupling values through equation (^. The extended maximum like- 
lihood equivalent estimators are formed ([^ and Appendix A) as the products of the mean 
values of the Optimal Observables and the expected number of observed events. The es- 
timation is then performed by solving the non linear system of the following equations: 

N 

J20kiV^;\') = fiC\)-[{0k{V; A°))ao 

n=l 

+ t.m{V- A°) ■ 0,(V; A°)),o - A°)),o ■ (0,(V; A°)),o] ■ A, (33) 

i=l 

/i(A) =C-[So + Y.Si-^k + Yl ■ A^Afc] 

k nk 

where, as in (pT]), the left hand side corresponds to the experimental measurement whilst 
the right hand side is evaluated as a function of the coupling values using the phenomeno- 
logical models. 



4 The Realistic Case 

When the detector effects are sizable and cannot be ignored, the p.d.f. which contains 
the whole available information is written as: 

P((], A) = / ^°^^^ + ^^'^'^^^-^^ + ^^'^;f^^^-^^-^^ . e{V) . RiV, n)dV (34) 
©0 + Efc ©1 ■ Afc + J2f_ik ©2 ■ A^ ■ Afc 

where 



Qi= ciiV)-eiV)dV (35) 



4.1 Optimal Variables Including Detector Effects 

Using the notation: 

~4m = j4{V)-e{V)-R{V,Q)dV (36) 



the p.d.f. (M) is written as: 

^ go(^) + Ek c'M ■ Afc + E,,. 5f (^) • Afc ■ A^ ^^^^ 
©0 + Sfc ©1 ■ Afc + J2iik ©2'^ ■ A^ ■ Afe 
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which retain the functional form of Consequently, in the case of a single TGC 

parameter model, the two directions of the phase space on which the projected p.d.f. 
retains the whole information will be: 

cm - f^^(V-<v-R(y^^)dy (38) 

JcoiV)-eiV)-RiV,n)dV 

. ^ Jc.iVj.eiXn-RivAdV 
Jco{V)-e{V)-R{V,Q)dV 
Equation (^) and ( |39D can be rewritten in the general form: 

Ci,2(^^) = / QidV) ■ DiV, Q)dV (40) 



where the definitions of (p!8D and (|T9|) have been used and D(y, Q) stands for the following 
expression: 

Div n) = M^)/Qo)-<^)-^(^-^) (41) 

Jico{V)/eo)-e{V)-RiV,n)dV 

The positive function D{V,Q), is less or equal to one and normalized to unity 
(/ D{V, Q)dV = 1). It expresses the conditional probability that: the kinematic vectors V 
generated with the p.d.f. co(F)/6o (i.e. with coupling equal to zero) will be observed as Q. 
Consequently the variables Ci,2(^) can been seen as the mean values of Qi^2{V) with the 
condition that the observed kinematic vectors be equal to Q. This interpretation of (|40|) 
suggests a pre-analysis stage, during which d and (2 will be evaluated as functions of Q 
by M.C. integration. However, the size of the necessary sample of M. C. events increases 
exponentially with the dimensionality of the phase space and thus, dealing with eight di- 
mensional phase space, it makes this pre-analysis impractical. The approach followed in 
this work, was to project the resolution function RiV, Q) on the Ci{Q) / Co{Q) , C2(fi)/co(fi) 
plane and approximate the variables Ci,2(^) as: 

Ci,2m ^ ^i,2(xi, X2) = J Qi,2iV) ■ D{V, xi, X2)dV (42) 

D{v,xi,x2) = J D{v,n)-s{xi-ci{n)/co{n))-s{x2-c2{n)/co{n))dn (43) 

where D{V,Xi,X2) is the the conditional probability that: the kinematic vectors V, gen- 
erated with the p.d.f. co(l^)/6o and observed as fl correspond to values of ci{Q) / co{Q) 
and C2(fi)/co(fi) equal to xi and X2 respectively. Figure |1| (and figure |^) show the de- 
pendence of zi{xi,X2) (-22(3^1,3^2)) on ci{Q)/co{Q) (c2(fi)/co(fi)) in different regions of 
C2(0)/co(r2) {ci{fl) / co{fi)) when the aw<f> TGC model is considered. For the evaluation of 
these quantities a M.C. set of events WW eueqq, produced with the EXCALIBUR @] 
four fermion generator with Standard Model (zero) couplings and passed through the full 
detector simulation program DELSIM 0, was used. Then the values of zi^2{xi,X2) which 
correspond to a region B of [ci{Q) / co{fl) , C2{fi) / co{Q)] plane were estimated according to 
(H) as 

1 "S 

^i,2(a;i,X2)~— EQi,2(V^.) (44) 
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where the reconstructed kinematic vectors fij of the M.C. events used in (Q) were 
such that ci{fli) / co{Qi) , C2{fli) / co{Qi) belongs to B. 

The straight hues in these figures indicate the region of equahty between the plotted 
quantities. The fact that the two quantities almost coincide^ suggests that the two direc- 
tions of the phase space on which the p.d.f. ( ^3]) retains the major part of the information 
when projected, can be further approximated by: 

C,.(n) '-i^ (45) 

4.2 Modified Observables Including Detector Effects 

By expanding the probability distribution (^4|) around A*^ and following the same argu- 
ments as in the previous section one finds that the mean values of the quantities 

Im(v-,\'')-4V)-R{v,n)dv 

— * 

have the same estimating efficiency as the unbinned likelihood for coupling values A close 
to the expansion point. These Optimal Observables ujk{^] A") which include the detector 
effects can be expressed as mean values under conditions, of the Optimal Observables 
OkiV; A°) defined in the ideal case. This can been seen by rewriting (^61) as: 

M^; A°) = / ■ B{v, n)dv (47) 



with 



and 



' J{y'o{V;X')/Zo{X'))-e{V)-R{V,n)dV 
^o(A°) = / yo{V; A°) ■ eiV)dV (49) 



where the conditional probability B{V; Q) expresses the probability that: the kinematic 
vectors V produced with p.d.f. ?/q(1^; A°)/Zo(A'') (i.e. with coupling values equal to A°j 
be observed as Q. The expected values of ujk{^', A°) for coupling values A close to the 
expansion point A" are linearly dependent on A as in (^9]). Namely: 



i=l 

- (u;,(f];A°))x,,-(^.(^l;A°)),^J-A, (50) 
where the brackets stand for the operation: 

(M^; A°))x = / M^; a°) ■ p{^; X)dQ (5i) 

^Similar results have been obtained with the aw and cross section parameterization. 
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The evaluation of the functional form of the Optimal Observable (^) in a preanalysis 
stage faces practical limitations, due to the necessary number of M.C. events when dealing 
with eight dimensional phase space. In this analysis, by projecting the resolution function 
on the A°)/?/Q(f2; A°) axis, equation ( ^71 ) is approximated as: 



A°) ~ afc(0; A°) = j 0,(V; A^) ■ BiV, (l>)dV (52) 

B{vJ) = J Biv,n) ■ fl6i(t>k- y'A^;\')/yoi^;X'))dn (53) 



fc=i 

0={0i,...,0,} (54) 

where B{V, (p; A°) is the conditional probability that: the kinematic vectors V gen- 
erated with the p.d.f. ?/o(^; A°)/Zo(A°) and observed as Q correspond to values of 
?/i (f2; A°/?/o(^; A°) which are equal to 0^. In figure ^, M.C. semileptonic events with 
an electron in the final state generated with the EXCALIBUR four fermion generator 
and with Standard Model couplings, having passed through the full detector simulation, 
are used to demonstrate that for several initial values of the coupling the avi/0 Optimal 
Observables can be further approximated^ as: 

a;,(n;A°)^^^S^ (55) 

Both approximations (|52D and (|55|) respect the linear dependence of the mean values of 
the Optimal Observables on the true coupling values. However, the observables (|55|)fl are 
not exactly optimal, in the sense that their mean values do not carry in principle the same 
information as the unbinned likelihood function. The consistency and the unbiasedness, 
though, of these estimators are guaranteed by the inclusion of all the detector effects. 

An estimation, based on the mean values of the Modified Observables, inherits the 
limitations of the Optimal Observables described in the previous section. Namely the 
linear dependence is valid only around the expansion point. In this paper we propose the 
evaluation of the mean of the Modified Observable: 



{y'[{n;X'))/{yo{n;X')) ■ P{n;X)dn 



for any value A by means of M.C. integration employing the reweighting technique ||T0| . 



This procedure is valid for any range of coupling values and consequently guarantees con- 
sistent estimation independently of the particular expansion point. The above arguments 
are demonstrated in figure ^ and ^. A collection of M.C. events, consisted from several 
sets produced at several couplings and passed through the full detector simulation^], had 
been used to evaluate, by reweighting, the dependence (in the following the calibration 
curve) of the mean values of the Modified Observables for electronic final states on the 
aw^ coupling for several {a^,^ = -2,0 and 2) expansion points. The points, in these fig- 
ures, represent the calibration curves whilst the straight lines in figure ^ correspond to the 
linear equation (pOf). It is, though, obvious that the consistency of an estimation based 



^ This found to be a good approximation for the aw and aBtj> Observables as weh. 

^ In the following Modified Observables 

^More details on these M.C. sets are given in Section 4. 
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on the linear relation (^Dj), as proposed in depends strongly on the expansion point. 
However, when the calibration curve is used, the estimation is consistent independently 
of the expansion point, as is shown in figure ^ where a set of a M.C electronic events 
have been used as a data sample. These events were generated with the EXCALIBUR 
four fermion generator with Standard Model couplings and passed through a full detector 
simulation. The mean values of the Modified Observables evaluated using the data sample 
as follows: 

are shown in figures |^a), ^d) and ^) as the central horizontal lines whilst the band around 
the central line corresponds to the statistical error of (|56|). Although the calibration 
curves coincide with the measured averages at the true coupling value independently of 
the expansion point, the error in the coupling estimation varies, ^ reaching its minimum 
at the expansion point which equals the true coupling value. 

The extended Modified Observable estimators which take into account the expected 
event multiplicity are built in the same way as in the ideal case, i.e. as the product: 

/i(A) ■ j uJkiCl- A°) ■ Pin- x)dn 

The calibration functions 

jr^(A; A°) = ^(A) • J ujkiQ; A°) • P{Q; X)dQ (57) 

are evaluated as before by reweighted M.C. integration and the coupling estimates are 
the solutions of the following system of p equations with p unknowns: 

j- yi^'yy = j-,(A;a°) (58) 

n=l Voi^n, A°) 

Since the estimation efficiency becomes optimal only when A ^ A°, the estimation being 
consistent for each A°, an iterative procedure can be followed which will converge to the 
above optimality condition. To demonstrate the convergence properties of this algorithm, 
three M.C. samples of 1000 WW — >• eu^qq events each, passed through detector simu- 
lation and generated with the EXCALIBUR four fermion generator with aw<j> coupling 
values equal to -2,0 and +2 respectively, were fitted by solving equation ([58| ) for several 
values of the expansion point (a^^^). After every fit, the difference between the estimated 
value of the coupling and the expansion value defines the step of the iterative pro- 
cedure. Convergence is achieved at the expansion point where the step equals to zero. 
As it is shown in figure ^, convergence is achieved after few only (two) iterations almost 
independently of the initial values. 

In the general case, where the data sample consists of a collection of the three semilep- 
tonic channels (muonic, electronic and tau), the p.d.f. is the following weighted sum of 
the individual distributions: 

P{n;X) = j^WfiX)- f gf{V;X)-Rf{V,n)dV 
f=i 

^In this example, for N ~ 1000, the errors were 0.1,0.07 and 0.1 for a^j, -2, and +2 respectively. 
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Pfin;X) = j gf{V-X)-Rf{V,Vi)dV 



^/(A) = 3 f .r^ (59) 

where the subscript / stands for the flavor of the final state lepton. Then the extended 
Modified Observable estimators are also defined as weighted sums of the form: 

3 3 

/=i /=i 

where M(A) stands for the expected number of the events in total, whilst /i/(A) denotes 
the expected number of events in the semileptonic channel /. In practice one has to 
evaluate three sets of calibration curves J-'kj{X, A"), one for each channel, and to use the 
A^(= Z]/=i Nf) selected events to solve the following system of equations: 

3 ^/ (n . \0\ 3 

EE^^^ = E-^^./(A;A°) (61) 

/=1 n=l Z/0,/l"n; > f=l 

5 Numerical Results 

In the previous section we proposed two strategies which include the detector effects in 
efficient estimators. In principle their asymptotic efficiency is less than the unbinned 



eight-dimensional likelihood estimator due to the projections (^) and (|52D . There is 
not a simple way to quantify the loss in information due to the fact that an unbinned 
maximum likelihood in eight dimensions estimator which includes the detector effects is 
practically impossible to build. Therefore, it makes more sense to discuss in detail the use 
and properties of these estimators in extracting the TGC's especially when one deals with 
small data samples such as the available data samples at 172 GeV centre of mass energy 
at LEPII. The examples given bellow concern one parameter fits of different sensitivities, 
where the estimation of the awct>, «iy and couplings have been chosen for this purpose. 

In all the following demonstrations the integrations performed by M.C techniques 
made use of the reweighting procedures to express accurately integrals as functions of the 
TGC's. Several M.C. samples, generated either with the PYTHIA [|12| (including only 
the resonant graphs, and ISR) or the EXCALIBUR (including the full set of four fermion 
graphs and ISR) generators at different coupling values and having passed through the 
detector simulation program DELSIM, were reweighted to correspond to the full set of 
four fermion diagrams with Coulomb corrections. These samples have been combined as 



in |]TU|, in order to increase the statistical accuracy of the reweighted M.C. integrations. In 
parallel other M.C. set of events, produced with the EXCALIBUR four fermion generator 
at certain coupling values, played the role of data sets after being passed through DELSIM. 
The selection criteria, applied to all the M.C. events, were those described in Section 2. 
The event multiplicity of each of the M.C. data samples was chosen according to Poissonian 
distributions with mean values corresponding to the expected number of events after 
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reconstruction with integrated luminosity ~ 10 pb^^ (a typical integrated luminosity 
received by the LEP experiments at ^/s = 172 GeV). As an example, the data sets 
which were produced with Standard Model couplings consist of n^ + rie + rir events, where 
the subscript denotes the lepton flavor in the final state and the n^,nf.,nT- multiplicities 
varied from set to set following Poissonian distributions with means 15.35, 12.5 and 5.64 
respectively. 



5.1 Binned Likelihood Fits 

The central assumption in this paper is that the resolution and the selection efficiency 
can not be expressed easily as functions of eight kinematic variables. Consequently the 
projected p.d.f. in the Ci(^)) C2(^) plane (|^) cannot be expressed analytically. Its nu- 
merical evaluation is, though, possible using M.C events. There are several techniques 
| rT| of functional interpolation but this work followed the simplest, namely, the determi- 
nation of the projected distribution in bins of Ci and ^2- Then the extended likelihood 
function, when N events are observed, distributed as rifc (fc = 1, . . . 3 ■ /3) events (including 
background contribution) in the k^^ {Ci!C2} bin, is written as: 



1 r r ixi. + , X (^fc-Mfc(A))^ ivk-bkf 

L^^* = n ^TT^ / / V e-(-^+^^)e"^^^e'^^dx, ■ dy, (62) 

Each of the f3 consecutive terms in this product belongs to one of the three semileptonic 
final states, whilst /ifc(A) (6^) and (Jk{X) (cbk) are the expected number of signal (back- 
ground) events in the k^^ bin and the gaussian error respectively. The evaluation of /ifc(A) 
and its error as a function of the couplings is done by reweighted M.C. integration where 
the detector effects and the contamination of each of the final state channels from each 
other have been taken into account. A technical detail, worth mentioning, is the fact that 
the calculation of the coordinates Ci,2 from the observed vector Q is solely based on the 
Matrix Elements because other factors, such as the phase space and that expressing the 
initial state radiation, cancel out in (^51). 



To demonstrate the statistical properties of this technique, 210 M.C. data samples 
at Standard Model Couplings and without background contribution were fitted. The 
produced distributions of the estimated aw(f„(ywj<^B<f> in one-TGC model fits, are shown 
in figures |^, ^ and ^a. These distributions are found to be in excellent agreement 
with Gaussians of means (0.01 ± 0.02, 0.007 ± 0.04 and 0.07 ± 0.1) consistent with the 
true coupling values and sigmas equal to 0.34 ± 0.02,0.6 ± 0.03,1.25 ± 0.05 respectively. 
Furthermore the pull distributions^, shown in |^, and ^ are found to be normal with 
sigmas (0.94 ± 0.05, 1.02 ± 0.06 and 0.97 ib 0.05) consistent with unity, which indicates 
that the errors of the estimations are correctly evaluated. This is also supported by the 
very good agreement of the sigma of the distribution of the estimations and the mean of 
the distribution of the estimated error in each individual fit (0.34 ± 0.04, 0.58 ± 0.05 and 
1.28 ± 0.02). Similar tests performed with 20 sets produced with aw4, values at -2, and 2 
demonstrated the same properties. 

lOWhich is the distribution of the deviation of each individual estimation from the true coupling 
normalized to the estimated error. 

At awci> — ~2 the average of the estimations are —1.99 ± 0.06 whilst the root mean squared pull is 
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5.2 Modified Observable Fits 



An estimation of the couplings based on the extended Modified Observable technique is 
expected to be at least equally efficient as the binned extended likelihood estimation of 
the previous section. Furthermore, the Modified Observable fits do not split the events 
into bins and in principle they suffer from less systematic error due to the M.C statistics. 
However, as has been emphasized in the previous sections, the optimal efficiency of this 
technique is achieved when the uncertainty on the measurement of the mean of the Mod- 
iffed Observables lies within the linear part of the calibration curve around the expansion 
point. 

The calibration curve (^(A; A")) and its statistical error(£^(A; A")) at the A° expansion 
point are evaluated by reweighted M.C. integration as functions of the fftted coupling. 
In the general case, taking also into account the statistical errors due to the ffnite M.C. 
statistics, the coupling value is extracted by maximizing the following likelihood function: 

L{X; Ao) = ^ = ■ e^(^^^») (63) 

v/2-7r-(aJ(Ao)+a,2(Ao)+£:2(A;Ao)) 

where 

{[N ■ {jjjn; \o))data - m ■ {ivin; Ao))b«cfc] - ^(A; Ap))^ 
^ ' " 2.(aKAo)+a,^(Ao)+^^(A;Ao)) ^''^ 

and N is the number of selected events with measured vectors {Qi, . . .Q^}, Nb is the 
number of background events expected in the data sample and <Jd{\o) is the measurement 
error on the quantity N ■ {uj{Q] X^))data which is evaluated from the data as: 

N ■ {coin; Ao))..,. ^ E E -^§44 (65) 

f=ln=l Z/Ojliin; Aoj 

The background term and its error, o"fe(Ao) are calculated by using a set of M background 
M.C. events as: 

M fr[ yoiik, Ao) 

As discussed in the previous sections, an iterative procedure has to be followed until the 
value of the coupling which maximizes (0) coincides with the expansion point. 

As in the case of the binned likelihood estimation, the 210 M.C. sets produced with 
Standard Model couplings and without background contamination were fitted to deter- 
mine the a\Y(f„ aw and couplings. The results are shown in Figures 0, [TT| and |T2 



where Gaussian fits have been performed to the estimation, pull and estimation error 
distribution. The technique exhibits the desired properties at least for the case of the 
aw(f) and aw fits, that is the estimation distributions are found consistent with Gaus- 
sians with means at {aw(t>) = —0.01 ± 0.02 and (aw) = 0.05 ± 0.05 (which indicates the 
unbiasedness of the estimation) and sigmas CTaw,/, = 0-33 ± 0.02 and aay^^ = 0.56 ± 0.03 
which are in excellent agreement with the mean of the distribution of the estimated errors 
in each individual fit, 0.33 ± 0.04 and 0.58 ± 0.01 respectively. The correct estimation 

0.96 ± 0.16. Similar tests at aw4> = +2 gave as mean of the estimations 1.98 ± 0.06 with a root mean 
squared of the pulls 1.007 ± 0.160. 
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of the errors in each fit is also demonstrated by the fact that the pull distributions are 
normal with sigmas consistent with unity (1.05 ± 0.07 for the aw<t> and 0.95 ± 0.06 for 
the aw fits). The same properties have been found in estimating aw(t> by using the 20 
sets of events produced at -2 and +2 coupling values 0. However, the fits to extract the 
coupling make apparent that there is a limitation to this technique. Although the 
distribution of the estimations has a Gaussian shape with mean consistent with the true 
coupling (0.08 ±0.10) and a sigma equal to 1.25 ±0.06, the pull distribution deviates from 
a Gaussian shape due to an excess of values at pull values around zero. Furthermore the 
distribution of the estimated errors consists of a gaussian peak centered around 1.3 and of 
a broad shoulder at higher values which indicates that for a fraction of the fits the error 
was overestimated. This is a direct consequence of a large statistical error in evaluating 
the experimental quantity (|65D compared to the region in which the approximation (|22|) 
is valid 0. 

5.3 Comparison of Fitting Techniques 

In order to quantify the loss in precision due to the approximations (^2|) and (|5^), the 210 
M.C. samples of events used to investigate the properties of the fitting techniques proposed 
in this paper were used also in unbinned extended likelihood fits by using either the true 
kinematic vectors of the events (in the following perfect extended likelihood fits) or the 
reconstructed kinematic vectors after the event has been treated in a 6c constrained fit 
(in the following 6c unbinned extended likelihood fits). The results of these fits as well as 
the results of the binned likelihood and the extended Modified Observables fits described 
in the previous sections are summarized in Table |I|. Although the available statistical 
accuracy provided by this M.C. experimentation with only 210 samples is not enough for 
strong statements, it can be seen that the 6c unbinned extended likelihood fits, suffer from 
biases in estimating the a^^ and a^j couplings. These biases are eliminated by application 
of our proposed techniques. 

Since the p.d.f (H) carries less information than the p.d.f (|]), it is to be expected 
that the precisions obtained by the binned 2-dimensional likelihood and the extended 
Modified Observable methods should be somewhat poorer than those from the perfect 
EML method. The statistics used for the results of Table |l] are insufficient to demonstrate 
this conclusively; however, it is clear that the loss of information is very moderate. 



6 Conclusions 

In this paper it has been shown that the reduction of the necessary kinematic variables 
in the estimation of TGCs is possible without loosing accuracy. This projection makes 
it possible to include the detector effects in a binned extended likelihood estimation by 



employing the reweighting |]T0| M.C. integration technique. Although this projection is 
useful only in one TGC parameter fits, an extension of the Optimal Observable technique 
0] proposed in this work can further reduce the necessary kinematic parameters to as 
many Modified Observables as the number of the TGCs which are fitted simultaneously 



The mean values of the estmiations are —1.99 ±0.02 and 2.024 ±0.022 whilst the root mean squared 
pulls are 0.9 ± 0.15 and 0.88 ± 0.16 for -2 and +2 true coupling values respectively. 
This can be seen by enlarging the error bands in figure |^ 
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from the data, while continuing to include the detector effects in the estimators. It is also 
shown that, by using the reweighting technique, an iterative procedure can be defined 
with optimal convergence properties. 

Both the proposed fitting strategies have been demonstrated to be unbiased estimators 
when small statistical samples, of the same size as the data sets available from the 172 
GeV LEPII run, were fitted. It has been also shown that they consistently evaluate the 
estimation error. However a limitation of the Modified Observable fits became apparent 
in evaluating the error in the fitted value of the 034, coupling. Such an overestimation 
is expected in fits where the data are not sensitive to the extracted parameter and will 
disappear as the available data sets increase in size. 

A comparison of the proposed techniques with the unbinned extended likelihood in 
perfect conditions shows that the estimation accuracy achieved in this analysis is very 
close to the maximum possible. 

A Appendix A 

For A = A*^'"^ equation (0) becomes identity. Furthermore by using (|^) and approximat- 
ing g{V; A) as in ( p2D the definition of the maximum likelihood estimator relates the mean 
values of the Optimum observables {yliV; A°)/?/o(^; A") as: 

yi(^;A°)/i/o(V^;A°)-Zi(A°)/Zo(A°) ^ y{iV;\') , 

^^^?r7^ ■ [1 + il^y'^^-^ A°)/yo(^; A°) - (A°)/Zo(A°)] ■ K,]]dV = (67) 

where = \k — A° and the term y\{y; A°)/?/o(^; A°) has been added and subtracted 
in the integrand. Then, by expressing the ratio of the integrated quantities as: 



Zl{\^) _ ry{{V-\^) yo{V-X^] 



and using ( pSj) the identity ( |67D becomes : 



■dV (6^ 



0,(\/;A°)-(?(F; \)dV = 
J 0.{V- A°) ■ g{V; X')dV + ^[/ 0^{V■, A°) ■ 0,{V; A°) ■ g{V- >P)dV 



k=l 



-( / 0,{V; A°) ■ g{V; X')dV) ■ ( J 0,{V; A°) ■ g{V; X')dV)] ■ A, (69) 

which is exactly how the coupling estimators based on the mean of optimal observables 
have been defined in (^). Thus in the neighborhood of A° ( where the approximation 



(22) is valid) the maximum likelihood estimation is identical to the Optimal Observable 



estimations and obviously the latter inherits all the properties of the former. 
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If instead of the likelihood function its extension ( P^D was used then the extended 
maximum likelihood estimators will be defined for observed events as: 



E[ ]yj ]a=A-- ■ {[^( ]y] )]a=A 

d ^ 



+ /[^ E log^(^^; ^)]a=a ■ tn A)]x=x-.ci\7i ■ ■ ■ ci\/^} = (70) 
* fc=i j=i 

The above condition obviously for A = A*™^ is an identity which can be reduced, after 
some algebra, to: 

j[^.^^^9{V-M-9{V-A)]uxdV = Q (71) 



and when the p.d.f is linearly approximated as in ( p2D the same expression as (|69D is 
achieved having replaced the terms OiiV] A°) with the products /i(A) ■ Oi{V; A°). 
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OiB(t> 


Perfect Extended Likelihood 








mean of estimations 


0.006 ± 0.020 


0.01 ± 0.03 


0.11 ± 0.08 


estimation accuracy 


0.30 ± 0.02 


0.55 ± 0.03 


1.10 ± 0.05 


6c Unbinned Extended Likelihood 








mean of estimations 


-0.07 ± 0.02 


0.14 ± 0.04 


0.001 ± 0.080 


estimation accuracy 


0.35± 0.02 


0.65 ± 0.03 


1.14 ± 0.06 


pull sigma 


1.10 ± 0.05 


1.35 ± 0.06 


1.03 ± 0.05 


Binned 2 dim. Extended Likelihood 








mean of estimations 


0.01 ± 0.02 


0.007 ± 0.040 


0.07 ± 0.10 


estimation accuracy 


0.34 ± 0.02 


0.60 ± 0.04 


1.25± 0.08 


pull sigma 


0.94 ± 0.05 


1.02 ± 0.06 


0.97 ± 0.05 


Extended Modified Observable 








mean of estimations 


-0.01 ± 0.02 


0.05 ± 0.05 


0.08 ± 0.1 


estimation accuracy 


0.33 ± 0.02 


0.56 ± 0.03 


1.25± 0.06 


pull sigma 


1.05 ± 0.07 


0.95 ± 0.05 


0.90* ± 0.05 



Table 1: Comparison of the statistical properties of the techniques proposed in this paper 
with the unbinned extended likelihood estimations. (* This value represents the root mean 
squared of the pulls) 
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Figure 1: The dependence of Zi{xi,X2) on the Ci(fi)/co(ll) in three different regions 
(a) 0.00-0.12, b) 0.12-0.25 and c) 0.25-1.00) of C2(Q)/co(^^) when the awcb TGC model is 
consider for electronic final states. The horizontal error-bars correspond to the size of the 
bin, whilst the solid line denotes the region of equality between the two plotted quantities. 
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c2(a)/c0(n) 
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c2(a)/c0(n) 




c2(a)/c0(a) 



Figure 2: The dependence of Z2{xi, X2) on the C2{^)/co{^i) in three different regions (a) 
-2.00-0.020, b) -0.20-0.25 and c) 0.25-1.00) of Ci(Q)/co(j1) when the aw^ TGC model is 
consider for electronic final states. The horizontal error-bars correspond to the size of the 
bin, whilst the solid line denotes the region of equality between the two plotted quantities. 
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Figure 3: The approximated Optimum Observable ai(0; a^r^p) , corresponding to the aw<j) 
coupUng for electronic final states, as a function of the quantity yl{^; a^^)/yQ{Q; a\^^) for 
several values of Q;^,j, = a) +1 b) 0.5 )c -0.5 d) -1. The horizontal error-bars correspond 
to the size of the bin, whilst the solid line denotes the region of equality between the two 
plotted quantities. 
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Figure 4: The mean value of the aw4> Modified Observables as a function of the awip 
for three different values of the expansion point (a^^ = —2, 0, +2). The points with the 
errors correspond to the reweighted M.C. integration technique whilst the straight lines 
represents the linear equation (51). 
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Figure 5: Estimation of the aw(j> coupling based on The average of the Modified Observ- 
ables for three different expansion pints. 



23 



C/5 



1 


-1 



_ 1 1 1 1 1 


1 


1 


1 


1 


1 


1 


1 


11- 


• • . . 

• • . , 


' • • • 


' • . . 


■ • • . . 




















' • . . 


' • • • 


' • . . 


' • • • 


• . . 


- 1 1 1 1 1 


1 


1 


1 


1 


1 


1 


1 


II- 



-3 -2.8 -2.6 -2.4 -2.2 -2 -1.8 -1.6 -1.4 -1.2 ^ -1 

Expansion Point (OLy/^^) 



1 


-1 



- 1 1 1 1 1 

" + 1 , 


1 


1 


1 


1 


1 


1 


1 


II- 




^ ♦ * ♦ 




^ * * * 


























^ * * * 


^ * ♦ ♦ 






-II 


1 


1 


1 


1 


1 


1 


1 


1 1 1 1 1 1 - 



■1 -0.8 -0.6 -0.4 -0.2 0.2 0.4 0.6 0.8 ^ 1 

Expansion Point (OL^^) 



*i 1 




-1 



_ 1 1 


1 


1 


1 


1 


1 


1 


1 


1 


1 1 _ 




' • • • . 


' • . . 






















' • • . 


* • . . 






















' • • • 


• . . 


' • • . 


' • • . 


' » • , 


-II 


1 


1 


1 


1 


1 


1 


1 


1 


II- 



1.2 1.4 1.6 1.8 



2.2 2.4 2.6 2.8 ^ : 
Expansion Point (CLy/i^) 



Figure 6: Estimation of the a\y^ coupling based on the mean value of the Modified 
Observable. Three M.C sets of events produced with different couplings (a) aw4, = 
— 2.,b) = 0. and c) awct = +2.) were fitted for several values of the expansion point 
(horizontal axis). The vertical axis represents the difference of the expansion point from 
the estimated value in each fit. 
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Figure 7: Results ( a) distribution of the estimations b) pull distribution c) distribution 
of the estimated errors) of Monte Carlo experimentation concerning the estimation of the 
aw<i) coupling with binned likelihood fits. The solid lines correspond to gaussian fits. 
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cc^ Estimations Estimated cc^ Errors 



Figure 8: Results ( a) distribution of the estimations b) pull distribution c) distribution 
of the estimated errors) of Monte Carlo experimentation concerning the estimation of the 
aw coupling with binned likehhood fits. The sohd lines correspond to gaussian fits. 
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Figure 9: Results ( a) distribution of the estimations b) pull distribution c) distribution 
of the estimated errors) of Monte Carlo experimentation concerning the estimation of the 
aBcf) coupling with binned likelihood fits. The solid lines correspond to gaussian fits. 
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Figure 10: Results ( a) distribution of the estimations b) pull distribution c) distribution 
of the estimated errors) of Monte Carlo experimentation concerning the estimation of 
the aw4> coupling based on the mean value of the Modified Observables. The sohd lines 
correspond to gaussian fits. 
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Figure 11: Results ( a) distribution of the estimations b) pull distribution c) distribution 
of the estimated errors) of Monte Carlo experimentation concerning the estimation of 
the aw coupling based on the mean value of the Modified Observables. The sohd lines 
correspond to gaussian fits. 
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Figure 12: Results ( a) distribution of the estimations b) pull distribution c) distribution 
of the estimated errors) of Monte Carlo experimentation concerning the estimation of 
the aB4> couphng based on the mean value of the Modified Observables. The sohd hues 
correspond to gaussian fits. 
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